Membrane clustering and the role of rebinding in biochemical 

signaling 

Andrew Mugler^, Aimee Gotway Bailey^, Koichi Takahashi^, Pieter Rein ten Wolde^' 

^FOM Institute AMOLF, Science Park 104, 1098 XG Amsterdam, The Netherlands 
^American Association for the Advancement of Science, Washington, DC 20005, USA 
^Advanced Sciences Institute, RIKEN, 1-7-22 Suehirocho, Tsurumi, Yokohama, 230-0045, Japan; 

The Molecular Sciences Institute, Berkeley, 2168 Shattuck Avenue, Berkeley, CA 94704; 
Institute for Advanced Biosciences, Keio University, 252-8520 Fujisawa, Japan 



Abstract 

In many cellular signaling pathways, key components form clusters at the cell membrane. Although 
much work has focused on the mechanisms behind such cluster formation, the implications for downstream 
signaling remain poorly understood. Here, motivated by recent experiments, we study via particle- 
based simulation a covalcnt modification network in which the activating component is either clustered 
or randomly distributed on the membrane. We find that while clustering reduces the response of a 
single-modification network, clustering can enhance the response of a doublc-modificatiou network. The 
reduction is a bulk effect: a cluster presents a smaller effective target to a substrate molecule in the bulk. 
The enhancement, on the other hand, is a local effect: a cluster promotes the rapid rebinding and second 
activation of singly active substrate molecules. As such, the enhancement relies upon frequent collisions 
on a short timescale, which leads to a diffusion coefficient at which the enhancement is optimal. We 
complement siunilatiou with analytic results at both the mean-field and first-passage distribution levels. 
Our results emphasize the importance of spatially resolved models, showing that significant effects of 
spatial correlations persist even in spatially averaged quantities such as response curves. 
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Introduction 



Although often modeled as well-mixed chemical reactors, cells are highly spatially heterogeneous entities. 
Beyond merely providing the blueprint for space-dependent processes such as division or patterning, spatial 
heterogeneities in cellular components are frequently exploited by biochemical networks as additional de- 
grees of freedom in signaling computations [T] . The most direct example is compartmentalization, in which 
the same chemical component initiates different phenotypic responses depending on where it is localized 
within the cell [21 [3] ■ In a similar way, the localization of signaling components via scaffolding proteins 
has effects on signal amplification that depend nontrivially on the surrounding chemical conditions [4]. In 
fact, the colocalization of just two components can have a dramatic response on the amplification properties 
of a enzyme-driven reaction network 0. Even in spatially uniform systems, spatial correlations between 
individual molecules can have significant effects on the mean response [3]. 

One of the most actively studied areas in which spatial heterogeneity is emerging as a key factor is signal 
transduction at the cell membrane. In addition to imposing a quasi-two-dimensional geometry, the membrane 
plays host to a large diversity of cellular components, the interactions among which give rise to a complex 
spatial organization f7^. A central theme of recent work in this field has been the prevalence and role of 
membrane clusters — groups of colocalized molecules that often participate in the detection of external signals 
and subsequently drive responses within the cell. Perhaps the most well known example occurs in bacterial 
chemotaxis, in which clusters of receptors detect external ligands, triggering messenger molecules to modulate 
the activity of flagellar motors [8] . Evidence for clustering in eukaryotic cell membranes has recently been 
observed as well: data from immuno-electron microscropy and single-molecule fluorescence experiments 
[10] suggest that Ras, a protein implicated in a variety of phenotypic responses including oncogenesis, forms 
membrane clusters on which the efficacy of its downstream signaling critically relies. Clustering may also 
be connected to the partitioning of the membrane itself into spatially segmented domains [Ullll], e.g. via 
interaction with the cytoskeleton fT^ or formation of so-called lipid rafts [Hj. 

Although much modeling work has been done to elucidate the possible mechanisms by which clusters form 
[ISl [ini E] , insights on the role that clustering plays in downstream signaling remain largely speculative. 
Therefore a driving goal of the present study is to quantitatively understand, for a spatially resolved model 
of a canonical signaling network, the effect that clustering has on the input-output response. Recognizing its 
ubiquity in the systems in which clustering is observed [IS] , we focus on a covalent modification network often 
called the push-pull network, in which a substrate is alternately activated and deactivated by two antagonistic 
components (Fig. [TJ\). For example, in bacterial chemotaxis, the kinase CheA and the phosphatase CheZ 
phosphorylate and dephosphorylate the messenger protein CheY, respectively; CheA and CheZ therefore 
play the roles of the two antagonistic components, while CheY plays the role of the substrate. 

Moreover, focusing on a push-pull network naturally permits extension to a double-modification process 
(Fig. [l^), which is a critical step in many membrane signaling pathways. In eukaryotic cells, for example, 
active Ras molecules at the membrane initiate a mitogen-activated protein kinase (MAPK) cascade within 
the cell, each layer of which consists of a dual phosphorylation cycle. In general, dual phosphorylation can be 
carried out by one of two mechanisms: in a processive mechanism, an enzyme modifies both phosphorylation 
sites on a substrate molecule before releasing it; in a distributive mechanism, on the other hand, the enzyme 
must release the substrate after modification of the first site, before rebinding and modifying the second 
site. It has been shown experimentally that key kinases [El [20] and phosphatases [21] in the MAPK cascade 
act in a distributive manner, making the rebinding process of critical importance. Therefore a second focus 
of the present study is to investigate the interplay between clustering and rebinding in determining the 
input-output response of a distributive push-pull network. 

We provide a spatially resolved description of the system by performing particle-based simulations on 
a lattice. In parallel, we gain important physical intuition from analytic results derived at both the mean- 
field and first-passage distribution levels. We find that the input-output response of the network changes 
depending on whether the activating component is clustered or randomly distributed on the membrane 
(Fig. [ip). Specifically, clustering reduces the response of a single- modification network, while clustering 
can enhance the response of a double-modification network. We demonstrate that the reduction is a direct 
consequence of the fact that a cluster presents a smaller effective target to a substrate molecule in the bulk. 
By investigating in detail the stochastic nature of the rebinding process, we discover that the enhancement 
has an entirely different origin: clustering promotes the rapid rebinding and second activation of singly active 
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Figure 1: Schematics of reaction networks and spatial arrangement of molecules. (A) The single-modification 
network, in which a substrate S is activated and deactivated by components Ea and Ed, respectively. (B) The 
double-modification network, in which the substrate can be doubly activated. (C) While S and E^ molecules 
diffuse freely in the cytoplasm, Ea molecules are fixed on the membrane in either a random (left) or clustered 
(right) configuration; right panel depicts a rebinding event, in which a singly activated S* molecule rapidly 
returns to the Ea cluster to become a doubly activated S** molecule. 



substrate molecules (Fig. [Tp). We find that such a rapid effect is only exploited when both the activating 
and deactivating components are sufficiently free to react, such that ultrasensitive networks 22 , in which 
one or the other component is saturated by the substrate, do not exhibit the enhancement. 

What's more, we find that the enhancement with clustering is more pronounced when the diffusion 
coefficient is large. Underlying this observation is a fundamental advantage that clustering affords in a 
collision-dominated regime: while diffusion may be high enough to prevent a substrate molecule from rapidly 
rebinding an isolated enzyme molecule, it may be insufficient for the substrate molecule to escape an entire 
cluster. Clustering thus prolongs the possibility of rapidly rebinding, effectively compensating for low as- 
sociation rates of individual molecules. Of course, this advantage reaches a limit — at infinite diffusion all 
spatial arrangement is forgotten. We are thus led naturally to a value of the diffusion coefficient at which 
the enhancement is optimal. 

Together our results provide a quantitative picture of the nontrivial effects that membrane clustering has 
on biochemical signaling, for a network that plays a critical role in systems in which clustering has been 
experimentally observed. More broadly, our results demonstrate the crucial role that spatial correlations play 
in cellular function, and the associated importance of considering spatial resolution in biophysical models. 

Methods 

We consider both a single- and a double-modification push-pull network in which the activating enzyme is 
localized to the membrane (Fig.[T]). To understand the effect of clustering, we compare the situation in which 
activating enzyme molecules are arranged randomly on the membrane to that in which they are localized at 
the same surface density to clusters of size N (Fig.jlp). Because we are interested in the effect of clustering 
on downstream signaling, and not in the dynamics of cluster formation on the membrane, we take activating 
enzyme molecules to be fixed. Substrate and deactivating enzyme molecules diffuse freely in the cytoplasm 
with diffusion coefficient D. 
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Chemical reactions, input-output relation, and sensitivity 

The single-modification network (Fig. is described by the reactions 

Ea + S^EaS^Ea + S*, (1) 

Ed + S* ^ EdS* ^Ea + S. (2) 

Here S and S* denote the substrate in its inactive and active forms, respectively. Activation is catalyzed 
by the activating enzyme Ea, which first forms a complex before releasing the substrate in its active state; 
deactivation is performed similarly by the deactivating enzyme E^. The double- modification network (Fig. 
[T|3) prescribes additional reactions identical to Eqns. I][2 except with 5" and S* replaced by S* and S** , 



respectively. We restrict our analysis to networks whose first and second modification processes are identical 
(i.e. the rates fci, fc2, . . . , fcg describing the first modification also describe the second). Furthermore all results 
in this paper assume negligible back reactions: k2 = = 0. 

The input of a push-pull network is typically defined as either the catalytic rate or the concentration 
of the activating enzyme. In the context of signaling via membrane clusters, the former definition finds 
natural justification: in chemotaxis, for example, the catalytic activity of the kinase is typically set by the 
time-averaged ligand occupancy of the receptor cluster [53]. We therefore take as the input parameter the 
catalytic rate fca, scaled by its counterpart kg for the deactivating enzyme: x = k^/k^. The output is naturally 
defined as the relative activity of the substrate, i.e. the fraction (f> = {[S*]/[S]t, [S**]/[S]t} for the single- 
or double- modification network, respectively. Here [S]t denotes the total substrate concentration, including 
all activity states and complexes in which it is involved. The fraction is a mean quantity, averaged over 
both time (post-equilibration) and space. Often we will normalize the output by (/imax: the maximum output 
assuming a deterministic, well-mixed description (Appendix |b| . The input-output curve (j){x) takes on a 
characteristically sigmoidal shape (e.g., Fig. [2]), becoming particularly sharp, or ultrasensitive, when the 
enzymes are saturated by the substrate |22) . 

In a deterministic, well-mixed description, in which rate equations determine the dynamics, the steady- 
state input-output relation is completely specified by the reaction rates and the conserved total concentrations 
of substrate [S]t and enzymes [Ea]T and [Edlr- In particular, for both the single- and the double-modification 
networks, one may write the input-output relations entirely in terms of the dimensionless parameters (e.g., 
see Appendix [b]) 

_ [EdW . _ fc4 _ K _ [EaW 

[EalT ki [b\T [t>\T 

where K = k^/k^ is the Michaelis-Menten concentration of the deactivation process, and [Ealr is N divided 
by the volume. The first two parameters determine the bias of the network toward deactivation; a = /? = 1 
therefore corresponds to a symmetric network, in which activating and deactivating enzymes have equal 
concentrations and association rates to the substrate. The last two parameters characterize the sensitivity of 
the network: in the zero-order (or ultrasensitive) regime, the substrate saturates the enzymes and operates 
far beyond the Michaelis-Menten concentration ({e, 7} <C 1); while in the Zmear regime, both substrate and 
enzymes operate in the linear regions of their response curves ({7"^, £7""'^} ^ !)• 



Spatial lattice model 

Although much can be understood from deterministic, well-mixed descriptions of push-pull networks |221I24I 
US] , we expect (and indeed will show) that spatial effects introduced by clustering will significantly influence 
the signaling properties of the system, even at the level of the mean response. We therefore perform spatially 
resolved simulations with excluded volume interactions by introducing a regular three-dimensional lattice. 
We make the approximation that all molecules have equal diameter i, and we let this diameter define 
the lattice spacing, such that molecules neighboring each other on the lattice are in contact. Clustered 
molecules are placed in contact in a square arrangement on the membrane, which is natural given the lattice 
implementation; we have tested that the results are not significantly affected by instead placing molecules in 
an arrangement that is circular (up to the lattice resolution). The membrane comprises the x-y plane and 
extends for a length L in each direction, beyond which periodic boundaries are imposed. The cytoplasm has 
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depth with reflective boundaries at both the membrane {z = 0) and the farthest point from it {z ~ Z). 
Appendix [A] provides a detailed account of how reactions and diffusion are implemented on the lattice, in 
particular such that detailed balance is obeyed. 

Spatial resolution introduces new parameters into the problem beyond those of the well-mixed system 
(Eqn. [3]), which are captured by the following dimensionless quantities. In addition to the cluster size A'^ one 
has 

The quantity l/AirS = ki/4TT£D is the ratio of the activating enzyme's intrinsic association rate ki, which 
is the association rate given that the molecules are in contact, to the corresponding diffusion-limited value 
Att£D; as such S captures the strength of diffusion relative to association. The parameter /i describes the 
surface density of activating enzymes and represents the introduction of the lengthscale L, beyond which 
the problem is periodic. The dimensionless length ^ reflects the fact that the membrane breaks the x-y-z 
symmetry, introducing a second lengthscale Z for the cytoplasmic depth. 

Finally, we identify a natural timescale as the time to diffuse approximately one molecular diameter 
and use it to define a dimensionless time: r = tl{P /D). Recovery of time-dependent quantities such as 
first-passage times requires specification of the ratio ^^/-D, while for steady state computations the physical 
values of the molecular diameter and diffusion coefficient drop out completely; only specification of the 
dimensionless parameters is required (x, N, and Eqns. 3]|4). 



Parameter selection 

The Results section discusses in detail the effects of varying the parameters that govern network symmetry 
(a, /3), sensitivity (7, e), and diffusion {5). In all results establishing network characteristics (Figs. [2j [sj [5] 
and|6]), the surface density of activating enzymes (/i) and the cytoplasmic depth {(,) are set using estimates 
from experimentally studied systems. In eukaryotic cells, clustered Ras has been measured to occupy a 
membrane surface fraction of /i < 1% |26| . A similar value arises in bacterial chemotaxis: the 'long' form 
of CheA (the form which both associates to receptors and phosphorylates CheY) is present in roughly 4500 
copies per Escherichia coli cell 127] ; taking a cell surface area of 6 p,w? and a typical protein diameter of 4 
nm [28], one obtains ji ^ 0.012. We therefore take /i — 0.01. The cytoplasmic depth is a measure of the 
maximum distance from the membrane that a molecule diffuses before encountering a reflective barrier (or, 
at the most, half the distance to the opposite membrane). In bacteria, this distance is upper bounded by half 
the smallest cell lengthscale, or ~500 nm. In eukaryotic cells, this depth is instead dictated by the presence 
of large organelles near the membrane. An extreme upper bound can be obtained by noting that organelles 
comprise roughly half the cell volume |29|, and imagining they are spherically packed in the center of a 
spherical cell of radius i? ~ 5 /xm implies a maximum depth of Z = [1 — (l/2)^/'^]i? ~ 1000 nm. Organelles 
are, of course, more loosely distributed within the cell, such that a depth on the order of 100 nm might be 
more realistic. We therefore take Z = 100 nm, which for a molecular diameter of 4 nm gives C = 25. 



Results 

We begin by presenting and explaining the main difference between the single- and double-modification 
networks: clustering reduces the response of a single-modification network, while clustering can enhance 
the response of a double-modification network. The magnitude of each effect scales with the cluster size 
N (Fig. [2|. The reduction for single- modification networks is generic, persisting with changes in network 
symmetry (a, sensitivity (7, e), and diffusion (6). The enhancement for double- modification networks, 
on the other hand, is more specific, occurring in deactivation-biased linear-sensitivity networks with high 
diffusion; subsequent results in this section will explain this specificity. 

Figure [2] also illustrates more generally the effect of localizing the activating enzymes to the membrane 
by comparing the spatially averaged response to the response in the well-mixed case (see Appendix |b]) . As 
seen in Fig. [2]4., localization reduces the maximal response of a single-modification network (compare the 
'well-mixed' curve to the 'random' curve). Such a reduction was seen in previous work [S] and is the result 
of the concentration gradients that form due to the asymmetric localization of activating and deactivating 
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Figure 2: Input-output response of a single- and double-modification network. Cluster size TV is varied at 
constant surface density fj, = 0.01 and depth ^ = 25, leaving the response unchanged with N in the random 
configuration. Well-mixed curve is established according to rate equations in steady state (Appendix |B| . 
Curves are normalized by the maximal value of the well-mixed response, 0max- (A) A symmetric {a — (3 — 1) 
single- modification network with moderate diffusion (S = l/Air) and zero-order sensitivity (7 = e — 0.1); here 
(p = [S*]/[S]t- (B) a deactivation-biased {a = 5, f3 ~ 1) double-modification network with high diffusion 
{S = 10) and linear sensitivity (7"^ — 0.05, 67^^ ~ 0.01); here (j) = [S'**]/[S']t- It is seen that in the clustered 
configuration, the response is reduced with N for the single-modification network, and enhanced with N for 
the double-modification network. 



enzymes. As seen in Fig. [2j3, the double- modification network can avoid this reduction and can in fact 
achieve an amplification beyond the well-mixed response instead. 

Clustering reduces the effective target size 

How does clustering reduce the response of a single-modification network? The key is that a cluster presents 
a smaller effective target to a molecule in the bulk. To understand this fact, one may consider that each 
molecule possesses a reaction volume — the volume that must be entered by the center of mass of a second 
molecule in order for an association reaction to take place (Fig.jsj'V). When the Ea molecules are arranged in 
a random configuration at low enough density, the total reaction volume (or target size) is simply N times 
an individual Ea molecule's reaction volume. However, when the Ea molecules are clustered, the individual 
reaction volumes overlap, and the total target size is reduced (Fig. [Sj'V). 

To understand quantitatively the impact of the target size reduction on the response of the network, we 
consider the time it takes an S molecule, released from the bulk, to bind an Ea molecule on the membrane 
(the lifetime of the S molecule). If the Ea molecules are free with high probability (i.e. unoccupied by other 
substrate molecules) the lifetime is dominated by the search time s, the time to find and bind an Ea molecule. 
The mean search time from the bulk can be estimated as the inverse of the association rate over the volume 
of the box: s « (fci/L^Z)^^ = L^ZS/£D. A random distribution of Ea molecules presents N targets of 
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Figure 3: The target size effect. (A) Cartoon depicting the reaction volume boundaries (black lines) of 
activating enzyme molecules (green circles) in a random (left) and clustered (right) configuration. The total 
reaction volume is smaller in the clustered configuration due to the overlap of individual molecules' volumes. 
(B) Main plot shows the mean lifetime of S molecules in the single-modification network; parameters are as 
in Fig. [2|\. Inset demonstrates that the value to which the mean lifetime asymptotes at high input x in the 
clustered configuration — the mean search time a'^ — grows as the square root of the cluster size N. 



diameter i, which reduces the mean search time by a factor of N: s'' = s/N — L'^ZS/NtD. A cluster, 
on the other hand, presents one target with effective diameter i^s = V^£, making the mean search time 
s'^ — L'^ Z6/{-\/N£)D. In terms of the dimensionless parameters, these times read a'' = /{f' /D) — (S/ji 
and a'^ = s'^/{i'^/D) — ^/N(S/fj,, which makes clear that at constant surface density the search time is 
independent of N for the random configuration but scales with N^^^ for the clustered configuration. 

Two important conditions of the above analysis are that the Ea molecules are free and that the S 
molecule is released randomly from the bulk (and not, say, still within the neighborhood of the cluster). 
The first condition is met at high input (x = k^/kg 3> 1), when the high catalytic rate of the activating 
enzymes leaves the Ea molecules free with high probability. The second condition is also met at high 
input for networks with zero-order sensitivity, in which saturation of the deactivating enzymes leaves the 
Ed molecules occupied with high probability. High occupation of E^ molecules means that a typical S* 
molecule has ample time to randomize its position before ultimately binding a free E^ molecule and being 
released as an S molecule. Thus, for S* molecules the arrangement of the Ea molecules is forgotten, while 
for S molecules the arrangement is critical. Indeed, at high input, clustering leads to substrate molecules 
spending more time in the inactive state, and thus to a reduced output (Fig.[2]A_). 

Fig. [3J3 shows the mean lifetime of S molecules as a function of the input x for a single-modification 
network with zero-order sensitivity. At high input, the mean lifetime asymptotes to the value corresponding 
to the search from the bulk, consistent with the above analysis. It is clear that for the clustered configuration, 
this asymptotic value depends on the cluster size N, and the inset shows that it indeed scales with N^^"^, as 
predicted. 

It is important to emphasize that the target size effect is a bulk effect, not a local effect, in the sense 
that clustering does not only reduce the number of neighboring sites from which a substrate molecule can 
bind, but more generally reduces the number of distinct paths which lead to the target from a point in the 
bulk. This intuition is confirmed by a simple test: under an alternative implementation, in which a substrate 
molecule can only bind an Ea molecule from the neighboring lattice site perpendicular to the membrane, we 
observe an increase in the search time with cluster size N that is only slightly less pronounced than that in 
the inset of Fig. [3|3 (not shown). Because this alternative implementation has the property that clustering 
the Ea molecules does not change the number of available neighboring sites, the increase in search time with 
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N is strictly due to a reduction in the number of paths from which the target is accessible. 

It is also important to point out that the target size effect is a generic property of diffusive random 
walks, and as such it is just as present for double-modification networks as it is for single-modification 
networks. However, as we will describe next, in a particular parameter regime the effects of rapid rebinding 
can overcome the target size effect, leading to an enhancement of the response rather than a reduction. 

Finally, we note that in networks with linear sensitivity, Ed molecules remain free, even at high input 
( Appendix |b]) . This freedom violates one of the assumptions of the simple scaling analysis above. Nonetheless, 
we find that the reduction with clustering in the response of single-modification networks persists. Next we 
discuss the role of rebinding in double-modification networks, which will ultimately also give insight into this 
persistence for single-modification networks. 



Clustering promotes rapid rebinding 

How does clustering enhance the response of a double-modification network? The key is that a cluster 
promotes rapid rebinding of singly activated substrate molecules. Rebinding only occurs in the double- 
modification network; in the single-modification network, once released by an enzyme, a substrate molecule 
can only bind to an enzyme of the opposite type. To clearly understand the rapid rebinding effect in double- 
modification networks, we first consider the distribution of rebinding times for a reduced system: a single 
S* molecule is released from one of N Ea molecules, with no Ed molecules present. It rebinds to any Ea 
molecule in a time r, whose dimensionless analog we define as p = rj^P^jU). 

As seen in Fig. [4j for both a random and clustered configuration of Ea molecules, the rebinding time 
distribution contains three regimes. Short times (the moZecM/ar regime) correspond to short excursions, after 
which the S* molecule rebinds to the same E^ molecule (or cluster) from which it came. Intermediate times 
(the planar regime) correspond to excursions that are sufficiently far for the S* to "see" the membrane as a 
plane uniformly populated with Ea molecules (or, due to the periodicity, with clusters), yet not far enough 
to see the reflective boundary; the granularity of individual Ea molecules is thus lost, and the membrane 
appears as a uniform semi-absorbent plane. Long times (the bulk regime) correspond to long excursions, 
during which the S* molecule randomizes its position completely, returning as if from the bulk. 



Bulk regime The bulk regime exhibits an exponential distribution because it describes the time to find 
an Ea molecule given a random starting position in the bulk. An exponential distribution is expected from a 
well-mixed system, in which reactions obey exponential waiting time statistics. Here, however, the molecular 
and planar regimes emerge due entirely to spatial correlations, affecting the network even at the level of the 
mean response (Fig. [2j3). Moreover, in the bulk regime, the substrate molecule has strayed far enough from 
the membrane that it effectively returns from the bulk; this return time is therefore equivalent to the search 
time defined above for S molecules in single-modification networks. Accordingly, one notices in Fig. |4] that 
the time constant characterizing the exponential is larger in the clustered case, precisely due to the target 
size effect previously discussed. Finally, the onset of the bulk regime is determined by the time it takes the 
S* molecule to randomize its position, which is approximately the time to diffuse the full cytoplasmic depth: 
rpb = Zy2D, or ppt = (^2. 



Planar regime In the planar regime, the substrate molecule has diffused not far enough to enter the bulk 
so that it loses memory of its starting position, but far enough that the membrane appears as a uniform semi- 
absorbent plane. The problem can be reduced to an effectively one-dimensional one in the z direction with a 
radiation boundary at z = 0. The one-dimensional rate fcoff (with dimensions of length per time) describing 
association at the boundary follows from a renormalization of the three-dimensional rate fci; clearly, fcog 
should scale with the surface density N/L'^, and we find good agreement with the simplest dimensionally 
consistent definition, keg = kiN / . 

As shown in Appendix [C] the rebinding time distribution for this one-dimensional problem is readily 
obtained from the Green's function and exhibits scalings of at short times, p~^/'^ at long times, 

and a crossover time of pp = {D/lkcsY — ^'^ 1 1^ ■ Short times comprise a collision-dominated subregime, 
in which the excursion is dominated by many unsuccessful refiections, and thus inherits the t~^l'^ scaling 
from the Gaussian Green's function of a particle freely diffusing in one dimension. Long times comprise a 
search-dominated subregime, in which after a long excursion the particle returns to an effectively absorbing 
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Figure 4: Rebinding time distributions. Distribution of times p for a single S* molecule, released from an 
Ea molecule, to rebind to an Ea molecule, when the Ea molecules are either randomly distributed (solid) or 
clustered (dash-dot) on the membrane; no Ed molecules are present. Short straight lines scale with p~^l'^ 
(shallow) and p^^l"^ (steep). To elucidate scalings, high values are taken for the dimensionless parameters 
describing cluster size {N — 100), diffusion (8 = 1.6), inverse surface density (/i~^ = 200), and cytoplasmic 
depth (C — 2000). For each of the two configurations, there emerge three regimes distinguishable by scalings. 
Times separating the regimes, as well as times marking scaling crossovers within regimes, are derived in 
the text and indicated on the figure: pj^p and pjjjp separate the molecular from the planar regime in the 
random and clustered configurations, respectively; pbp separates the planar from the bulk regime in both 
configurations; pjjj marks the scaling crossover within the molecular regime in the clustered configuration; 
and pp marks the scaling crossover within the planar regime in both configurations. 



boundary, producing the f^^"^ scaling characteristic of a one-dimensional random walker returning to an 
absorbing origin. Further detail is provided in Appendix [C| 

The transition between the molecular and planar regimes occurs when the S* molecule diffuses far 
enough perpendicular to the membrane that it no longer detects the granularity of the Ea molecules, a 
distance roughly equal to half the mean spacing between E^ molecules in the random configuration, or 
between clusters in the clustered configuration. In the random configuration, the mean spacing between 
Ea molecules is set by the surface density, yielding a separating time of r\-^^ = [-^Z (L'^/N)/2]'^ /2D, or 
pI-^p = 1/8/i. In the clustered configuration, the spacing between clusters is L, yielding a separating time of 
rS,p = {LI2Y/2D, or pl,^ = N/8f,. 

Molecular regime The molecular regime is defined by short excursions, in which the substrate molecule 
rebinds to the Ea molecule or cluster from which it came. The molecular regime exhibits p~^/'^ and p~^l^ 
scalings whose origins are the same as those in the planar regime: the scalings arise from a collision-dominated 
or search-dominated return, respectively, to a single molecule or cluster. For a return to single molecule, 
which applies to the random configuration, these scalings were described in previous work p]. The crossover 
time was derived to be 

" (1 + kl^Mf ' 

where here fc = 2fci, the factor of two arising from reflection of the Ea across the membrane. One sees 
from Fig. [4j however, that in the random configuration the crossover time is obscured by alternations in the 
probability density at short times, which is an artifact of the lattice implementation. To be precise: an S* 
molecule starting next to an Ea molecule can only rebind in an odd number of time steps (assuming it moves 
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diffusively every time step, which is true at short times for large S); the exception occurs when another Ea 
molecule is placed next to or very near the first Ea molecule, but at low surface densities such a placement 
occurs with low probability. We have validated the distributions in Fig. |4] using Green's Function Reaction 
Dynamics [5], verifying that lattice artifacts do not quantitatively change the probability densities. 

In the clustered configuration, the crossover time within the molecular regime is indeed resolvable and 
can be described in terms of the previously considered results. A large, absorbent cluster {N ^1,5^1) 
can be approximated as a plane with an effective one-dimensional association rate kcs = ki/l"^, yielding a 
dimensionless crossover time of {D/£kcs)'^ = 5^ . In the opposite limit, a small, reflective cluster [N ^ 1, 
(5^1) can be approximated as a spherical object whose effective diameter is obtained by equating surface 
areas: 47r(£eff/2)^ — 2N£'^ (neglecting cluster edges). In the hmit of large S the denominator in Eqn. [s] 
approaches unity, making the crossover time approximately i'^g/D, or iV/27r in dimensionless units. Since 
the expressions in both the plane and sphere limits scale with parameters that are large in the opposite 
limits, we use the minimmn as an estimate of the crossover time: pjjj ~ min((5^, N/2tt). 

Figure |4] corroborates all scalings and crossover times derived above using an illustrative set of sample 
parameters. Because we have analytic estimates for the crossover times, they can be tuned to expand or 
contract the various regimes, a fact we have used to confirm the validity of the scalings beyond the confidence 
implied by Fig. |4] alone. 

Figure |4] also directly displays the advantage that clustering affords in the rebinding problem: at short 
times, the probability of rebinding is enhanced, leading to a probability gap over the random configuration. 
In fact, the characteristic time that determines the extent of this gap, p'^, reveals the parameter regimes 
that give rise to enhanced rebinding, and thus ultimately to an enhanced signal output for the network. 
Specifically, the gap increases as p'^ increases, both via increasing the cluster size N and increasing diffusion 
relative to association, 6. Increasing the cluster size is a straightforward way of enhancing rebinding, and 
the associated enhancement of the output is demonstrated in Fig. [2j3. 

The reason that increasing diffusion increases the probability gap is perhaps less straightforward but can 
be understood at the molecular level. High diffusion induces many unsuccessful collisions before eventual 
rebinding. Rapidly rebinding to a single Ea molecule (which is the task when the Ea molecules are randomly 
distributed) is is therefore unlikely. Rapidly rebinding to a cluster, on the other hand, is less unlikely, owing 
to the presence of neighbors. The number of collisions in the neighborhood of a cluster is simply larger than 
that for a single molecule by virtue of its increased size. The probability of ultimately achieving a successful 
collision is thus higher for the clustered configuration than for the random configuration, by a factor that 
increases as the system is placed more strongly in the diffusive, or collision-dominated, regime. 

At a more detailed mechanistic level, we may consider the fate of an S* molecule that has just been 
released by an Ea molecule, and now resides at a neighboring lattice point. At high diffusion, the probability 
is large that the S* molecule will take a step away from the Ea molecule. In the random configuration, it 
is then increasingly likely for diffusion to carry the S* molecule away from the immediate vicinity of the 
Ea molecule. In the clustered configuration, on the other hand, several of these diffusive paths will lead 
directly to another Ea molecule. Clustering therefore poses an advantage when high diffusion ensures that 
immediate rebinding is unlikely, but rebinding after several diffusive steps is more probable. 

Of course, the advantage afforded by clustering cannot persist in the infinite diffusion (well-mixed) limit 
— in this limit, all spatial information is lost. The consequences of this fact for the network output are 
discussed in more detail later in this section. 

Deactivation connects rebinding to the network response 

Interestingly, despite the probability gap elucidated above, we observe that the means of the two rebinding 
distributions in Fig.[4]are the same: the enhancement conferred to the clustered configuration in the molecular 
regime is compensated by the target size effect in the bulk regime. The equivalence of means is a consequence 
of the fact that we have isolated the rebinding process. Alone, the rebinding process is equivalent to one 
dissociation and subsequent association event of the equilibrium reaction A + B ^ C. For equilibrium 
reactions, detailed balance ensures that mean quantities arc unaffected by spatial arrangement. Therefore, 
although the push-pull network as a whole prescribes a non-equilibrium process, the rebinding process 
alone is effectively in equilibrium, and the mean rebinding time is the same for a random and a clustered 
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ratio of deactivating to activating enzymes, q = [Ed]T /[Eajr 



Figure 5: The capture effect. (A) As deactivating enzymes are introduced, increasing the enzyme ratio a, the 
mean rebinding time p decreases more quickly for a clustered configuration than for a random configuration. 
Here /z = 0.01, C = 25, TV = 25, and 6 = 10. (B) Correspondingly, in a double-modification network, the 
maximal output 0max for the clustered configuration is higher than that for the random configuration at 
large a. Solid line shows the analytic prediction in the well-mixed limit, from which the data deviate due 
to spatial effects. Inset shows same data normalized by the well-mixed prediction (f>max- Sensitivity is linear 
(7~^ = 0.05, £7^^ = 0.01) with /3 ~ 1 and other parameters as in A. 



configuration. 

How then does the probability gap translate to an enhancement with clustering at the level of the mean 
response, as in Fig. [2f3? Indeed, it is precisely because thus far in the discussion we have not reintroduced 
the Ed molecules. The effect of the Ed molecules is to bind and deactivate the S* molecules with the 
longest excursion times, removing them from the rebinding problem and thereby truncating the rebinding 
distributions beyond a characteristic timescale, which we call the capture time. The truncation alleviates the 
target size effect, imparting the clustered configuration with a shorter mean rebinding time than the random 
configuration. 

The capture effect is illustrated in Fig. [Sj'V, in which Ed molecules are gradually reintroduced into the 
system. With Ed molecules present, an S* molecule has two fates /: it may rebind to an Ea molecule 
(/ = +) or be captured by an Ed molecule (/ — — ); measuring the time r for either fate samples the 
joint distribution p{T,f). The mean rebinding time p is then computed from the conditional distribution 
p{t\+) = p{t,+)/p{+), where p{+) is the total probability of rebinding as opposed to capture. Figure 
[5]^ shows that the difference in mean rebinding times between the random and clustered configurations, 
Ap = p^ — p'^ , indeed increases as the ratio a of Ed to Ea molecules is increased. The increase abates at large 
a, when the truncation at the capture time dominates the rebinding distribution, such that p approaches the 
capture time. The capture time can be estimated as the inverse of the association rate times the concentration 
of Ed molecules, {ki[Ed]T)~^ , or S(^/af3p, in dimensionless units; accordingly, Fig. [sJA demonstrates that p 
scales with at large a. 

The result of the capture effect for the double- modification network is illustrated in Fig. [5j3, which 
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shows the maximal output (pmax as a function of a for a network with hnear sensitivity. One observes that 
the clustered configuration produces a larger (jjmax than the random configuration beyond an enzyme ratio 
of a* 1.5 (see inset), indicating that clustering enhances the output when there are more deactivating 
enzymes than activating enzymes, such that the capture effect is strong. Indeed, at a* the difference in 
mean rebinding times (Ap ~ 9 x 10^) sufficiently surpasses the corresponding difference in mean search 
times (A(7 = a'^ — a'^ 3 x 10^) — that is, the capture effect overtakes the target size effect. 

Lastly, we point out that the rebinding distributions in Fig. |4j although strictly a component of double- 
modification networks, also lend intuition to the analysis of single-modification networks. Although rebinding 
does not occur directly in single- modification networks, the process of an S* being released from an Ea, 
binding an E^i, being released as an S, and returning to an Ea can be thought of as an effective rebinding 
excursion. The fact that deactivation must occur during this excursion imposes a minimum rebinding time, 
effectively introducing a truncation of the rebinding distributions at short times, thereby imparting the 
clustered configuration with a longer mean rebinding time than the random configuration. This result helps 
explain why the reduction in output seen with clustering in single-modification networks (Fig. [2j'V) persists 
even in networks with linear sensitivity: since the E^ molecules are free with high probability even at 
high input, the mean deactivation time is the same for both configurations; the extra time in the clustered 
configuration must therefore be spent while the substrate is inactive, leading to a lower output on average. 

Clustering benefits double-modification networks with linear sensitivity 

Figure [5J3 clearly illustrates a trade-off: at high a, clustering enhances the network output beyond that of the 
random configuration, but increasing a reduces the maximal output in general. The reduction with a can be 
derived in the deterministic, well-mixed limit (Appendix [b|; the result, (^max = {l + af3 + a'^/3'^)~^, is overlaid 
in Figure and provides a baseline from which the data diverge due to spatial effects. The intuition behind 
the reduction is straightforward: biasing the network toward deactivation reduces the fraction of active 
substrate. The reduction, however, is unique to networks with linear sensitivity, which leads to the question: 
can clustering enhance the network response in the zero-order regime, in which the maximal output remains 
high? 

The answer, revealed in Fig. [6]A, is no: as the sensitivity is varied from linear to zero-order, the en- 
hancement with clustering vanishes, then reverses. The reason is that the capture effect, which underlies the 
enhancement with clustering, relies on the Ed molecules being free. The zero-order regime corresponds to 
saturation of the enzymes by the substrate, such that at high input the E^ molecules are not free, but rather 
they are occupied with high probability. The mean capture time is then exceedingly long, such that the 
mean rebinding time is independent of the configuration of Ea moelcules. The target size effect takes over, 
and the random configuration produces a higher output. The end result is that the benefit of clustering is 
specific to double-modification networks with linear sensitivity; in ultrasensitive networks the benefit is lost. 

Clustering leads to an optimal diffusion coefficient 

In discussing the rebinding distributions, it was discovered that increasing diffusion enhances rapid rebind- 
ing to a cluster more strongly than to a random configuration, because it places the system in a more 
collision-dominated regime. One then expects the enhancement with clustering to increase with the diffusion 
coefficient. However, we also know that at very high diffusion, the network is well-mixed, and the spatial 
arrangement of the molecules is irrelevant; clustering should therefore confer no enhancement at very high 
diffusion. In fact, these competing effects lead to a diffusion coefficient at which the enhancement is optimal, 
as shown in Fig. [6j3. 

Figure [6|3 illustrates that as the ratio S of diffusion to association is increased, the enhancement, i.e. the 
difference in maximal output between the clustered and the random configuration A0,„ax = ^max ~ '^maxi 
first increases then decreases. The inset shows this nonmonotonic behavior for several values of the cluster 
size N. The optimal enhancement increases with N; moreover, the value S* at which the optimum occurs 
also increases with N. These observations are consistent with the notion that a larger cluster can confer an 
advantage more effectively in a highly diffusive regime. 

Quantitatively, S* approaches ~10 for the largest cluster size {N = 100), corresponding to an association 
rate ki ^ £D/10, roughly 10 • 47r 120 times less than the diffusion-limited value. Optimal enhancement 
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Figure 6: The effects of varying sensitivity and diffusion on tire response of a double-modification network. 
Both panels take N = 25, a = 5, (3 = 1, fi = 0.01, and C = 25. (A) The input-output response of a network 
with high diffusion {S = 10) as sensitivity is varied from linear (7"^ = 0.05, £7"^ = 0.01) to intermediate 
(7 = 0.2, e — 0.1) to zero-order (7 = 0.05, e = 0.05). (B) The input-output response of a linear network 
(7"^ = 0.05, £7^^ = 0.01) as diffusion is varied. The inset shows the normalized difference between the 
maximal output in the clustered and random configurations (the enhancement) versus 6 for several values of 
the cluster size N. 



therefore occurs in a regime in which unsuccessful collisions are very frequent because association is very 
rare. This result suggests a possible function of clustering in a double-modification system, in terms of 
compensating for weak association rates of individual enzymes: by promoting rapid rebinding events, a 
cluster enhances the total probability of associating. This function may have evolved as a way for a cell to 
boost the response when intrinsic association rates are very low. 



Discussion 

We provide a detailed view of the varied effects that membrane clustering can have on the signaling prop- 
erties of a canonical biochemical network. The network under study and the values of relevant biophysical 
parameters are drawn from experimentally studied systems, both prokaryotic and eukaryotic, in which mem- 
brane clustering has been recently observed. We implement a spatially resolved model, appealing to both 
simulation and analytic results to demonstrate that spatial correlations can have nontrivial effects, even at 
the level of the mean input-output response. In particular, we show that spatial effects at both the bulk scale 
— in terms of a diffusive target search process — and at the molecular scale — in terms of rapid stochastic 
rebinding events — affect the response of a network in ways that are not captured by a well-mixed, spatially 
uniform description. 

Our results make clear that the effect of clustering depends on both the network topology and the 
biochemical parameters. For example, we identify a general property of diffusive random walks — that clus- 
tering the target increases the search time from the bulk — which leads generically to a reduced response in a 
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single-modification network. However, when the topology of the network is extended to double-modification, 
the reduction can be overcome by a local effect: clustering promotes rapid rebinding of singly active sub- 
strate molecules to the activating enzyme molecules. When the concentration of free deactivating enzyme 
molecules is sufficiently high to isolate these rapid rebinding events, the result is an enhancement of the 
response. Importantly, this enhancement is specific to networks with linear sensitivity; ultrasensitive net- 
works, in which the deactivating enzyme molecules are saturated by the substrate at high input, do not 
confer the enhancement because the mechanism relies on the deactivating enzyme molecules being free. 
Moreover, the enhancement is most pronounced in a highly diffusive regime, in which unsuccessful collisions 
dominate, making the probability to escape a single activating enzyme molecule much higher than that to 
escape a cluster. The specificity of the enhancement to both linear sensitivity and high diffusion highlights 
the importance of biochemical parameters in predicting the effects of clustering. 

The result that clustering is most beneficial in a highly diffusive regime has important functional implica- 
tions. We find that the diffusion coefficient at which the cluster-induced enhancement is optimal corresponds 
to an intrinsic association rate roughly 100 times smaller than its diffusion-limited value. This finding implies 
that clustering is most helpful for signaling when intrinsic association rates are very low. Mechanistically, 
such a result is sensible, since the presence of clustered neighbors can boost the overall probability of re- 
binding, even when the probability of rebinding to a single molecule is negligible. Functionally, this result 
suggests that membrane clustering may have evolved as a way to boost signal output despite low intrinsic 
association rates. Indeed, such a result could be tested experimentally: our study predicts that enzymes 
that are involved in clustered signaling complexes are likely to have intrinsic association rates that are lower 
than the diffusion-limited value, placing the system in a collision-dominated regime. It would be interesting 
to test this prediction with a controlled assay of binding affinites or a curation of binding rates for proteins 
that are known to cluster. 

Our findings emphasize the important role of the rebinding process in biochemical signaling. The im- 
portance of rebinding has been discussed in related systems, with interesting consequences for the mean 
response. For example, in studying how the diffusive motion of a repressor protein effects gene expression, 
it has been observed that the mean response remains describable by a well-mixed theory, albeit with param- 
eters that are appropriately rescaled to account for enhanced noise [SU]. In studying signaling via a MAPK 
cascade, on the other hand, it has been found that spatial correlations due to rapid rebinding introduce 
qualitative changes in the mean response that cannot be captured by the well- mixed theory [6] . Our results 
here are more resonant with the second case, since it is clear that membrane localization and subsequent 
clustering introduce changes to the rebinding statistics (Fig. ^ that go beyond the exponential distributions 
expected from a well-mixed description. More broadly, the importance of rebinding has been recognized 
in explaining the potency of T cell ligand binding, in which a long aggregated binding time arises from a 
sequence of many fast rebinding events [31] . 

This study represents a first step in using simulation and analytic techniques to understand the role of 
spatial organization in signaling. It is our view that spatially resolved models, as well as a sharp theoretical 
framework, can help formalize and make more quantitative the inferences that are being made from the 
wealth of experimental data on systems which exhibit clustering, colocalization, and other nontrivial spatial 
heterogeneity. 
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A Reaction-diffusion implementation on the lattice 

In this appendix, we describe how reactions and diffusion are implemented for particles on the lattice. 
In particular, the implementation obeys detailed balance and ensures that the deterministic results (next 
section) are recovered by spatially averaged quantities in the high-diffusion limit. 
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We remind the reader that we consider a regular three-dimensional lattice with excluded volume interac- 
tions. We make the approximation that all molecules have equal diameter i, and we let this diameter define 
the lattice spacing, such that molecules neighboring each other on the lattice are in contact. In the clustered 
configuration, N molecules are placed in contact in a square arrangement on the membrane. The membrane 
comprises the x-y plane and extends for a length L in each direction, beyond which periodic boundaries are 
imposed. The cytoplasm has depth Z, with reflective boundaries at both the membrane (z = 0) and the 
farthest point from it {z = Z). Reflection at z = implies that cytoplasmic molecules do not bind directly 
to the membrane, but rather only bind to the cytoplasmic domain of membrane-bound molecules. 

Particle numbers used in the simulation box of volume L^Z are expressible in terms of the dimensionless 
parameters in Eqns.[3]|4] the number of activating enzyme molecules is N, the number of deactivating enzyme 
molecules is aN, and the number of substrate molecules is N/e. 

Over timescales longer than the time to diffuse a few molecular diameters, rotational diffusion sufficiently 
randomizes a molecule's orientation. Thus although we imagine each deactivating enzyme as possessing a 
catalytic domain to which the substrate binds, we model its reaction propensities as isotropically distributed 
over its surface. Since the activating enzymes, on the other hand, are membrane-bound, the situation is 
more subtle: we suppose that the cytoplasmic domain of each activating enzyme molecule traces out the 
hemispherical solid angle inside the membrane, except when blocked by neighboring activating enzymes (a 
consideration particularly relevant when the activating enzymes are clustered). Neighbors thus have the 
effect of reducing the molecule's cross-section: the reaction propensity of each activating enzyme molecule 
is distributed over the portion of its surface both inside the membrane and unblocked by neighbors. 

We note that the system as described can be mapped to a statistically equivalent system with periodic 
boundaries in the z direction, which offers both simpler implementation and, in some cases, more direct 
analytic interpretation. Specifically, the reflective boundaries at z = and Z are equivalent to a periodic 
boundary at z = and 2Z, so long as we recognize that the cytoplasmic molecules then double in number and 
the membrane-bound molecules (having been reflected across the membrane) become twice as reactive. The 
periodic boundaries confer the advantage that the membrane no longer needs to be explicitly implemented 
in the simulation: cytoplasmic molecules can occupy the plane in which the activating enzyme molecules 
reside, and substrate molecules can bind to activating enzyme molecules from any free neighboring site. 

We describe the implementation of reactions and diffusion on the lattice using a simple example, then 
extend to the push-pull reactions (Eqns. [l][2]). We consider the binary reversible reaction A + B ^ C , in 
which association and dissociation are described by intrinsic rates ka (with dimensions of length cubed per 
time) and kd (with dimensions of inverse time) , respectively. Dissociation is modeled as a first-order reaction 
event with an exponential waiting time distribution; the probability for a C molecule to dissociate in a time 
step dt is thus pd « kddt for small pd- Association is set by detailed balance, which equates the ratio of 
microscopic probabilities to enter and leave a reaction state to the ratio of macroscopic rates |32j : on a 
lattice with spacing £, the detailed balance condition reads Pa^^/Pd = ka/kd, yielding pa — {ka/£^)dt for the 
association probability of an A and B molecule at contact. Finally, diffusion is implemented according to its 
microscopic definition, namely that the mean squared distance traveled in a time dt is 6Ddt; the six possible 
moves on the lattice result a mean squared distance of &pd^^ in one time step, making pn ~ {D/P)dt the 
probability for a molecule to diffuse to a neighboring site. 

We now extend the above expressions to the push-pull reactions and write them in terms of the di- 
mensionless parameters (Eqns. 3]|4), the cluster size TV, the input x = fca/fcg, and the dimensionless time 
T = t/{P /D). The probability to diffuse to a neighboring site in a time step dt is po = {D/£'^)dt = dr. 
There are two association reactions, with probabilities of occurring from contact pi = {ki/i^)dt — {l/S)dT 
(activation) and p4 ~ {k4^/£^)dt = {/3/S)dT (deactivation). Lastly, there are two dissociation reactions, 
with probabilities of occuring p^ = k^dt ~ {xP^ / 5eQdT (activation) and pq = k^dt = (l3jp,/SeC)dT (de- 
activation), where in addition to using the definitions of the dimensionless parameters, we have recognized 
explicitly that [Ealr = N/Li^Z . The time step dr is chosen small enough that the sum of each molecule's 
diffusion and reaction probabilities is bounded from above by one at all times. 

Both association and dissociation probabilities are divided uniformly over the faces of each molecule (or, 
in the case of membrane-bound molecules, the faces not blocked by fixed neighbors). Furthermore, the 
choice of which molecule actually moves during a dissociation event is determined by diffusion: in the binary 
reaction, A would move with probability Da/{Da-\-Db) and B would move with probability Db/{Da + Db). 
In practice, then, when a substrate molecule dissociates from an activating enzyme molecule, the substrate 
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molecule always moves, because the activating enzymes are fixed. When a substrate molecule dissociates from 
a deactivating enzyme molecule, on the other hand, each molecule moves with probability 1/2, because the 
diffusion coefficients are equal. These choices ensure that the total probability of associating or dissociating 
in each time step sums to Pa or p^;, respectively, and therefore that detailed balance is obeyed. 



B Analytic results in the deterministic, well-mixed limit 

In this appendix, we derive key analytic results for both the single- and double-modification push-pull 
network in the deterministic, well-mixed limit (i.e. invoking rate equations). Strictly speaking, these results 
are exact for averaged quantities in the limit of infinite diffusion. More broadly, however, they lend powerful 
intuition to the spatially resolved results, even when diffusion plays a significant role. 



B.l The single-modification network 

We begin with the single- modification network (Eqns. l][2), which in steady state is described by the rate 
equations 

= ^ = -h[E,][S]+ke[EdS*], (6) 
at 

= ^ = -k4E,][S*] + h[EaSl (7) 
at 

= ^^^^^^^k,[E^][S]+ks[E^S], (8) 

= '^^~'^^~k4E,][S*]+UE,S*]. (9) 

Here, as in the main text, we neglect back reactions (^2 = ^5 = 0). The rate equations are complemented 
by the conservation laws 

[EaW = [Ea] + [EaSi (10) 

[EdW = [Ed] + [EdS*], (11) 
[S]t = [S] + [S*] + [EaS] + [EdS*]. (12) 

As implied by the conservation laws, the rate equations contain three redundancies from the zero net flux of 
activating enzyme, deactivating enzyme, and substrate; two are made explicit in Eqns. [8]|9j and the third is 
revealed by the fact that the sum of Eqns. |6][7] equals the sum of Eqns. [8]|9] Eqns. pp2 thus constitute six 
independent equations for six unknowns. Scaling concentrations by the Michaclis-Mcnten concentration of 
the deactivation process, K = ke/k^, 

_[Ea] _[S] _[Ed] _[EaS] _ [EdS*] 

Xl = — , 2:2 = —, ^3 = -^, 0:4=-^, X5 = — (13) 

and recalling the definitions of the dimensionless parameters introduced in Eqn. |3j 

[Ea]T ^" fci' ^" [S]t' [S]t ' ^ ' 

the six independent equations may be written 

X1X2 = I3x5, (15) 

X'^x^cj) = 7x4, (16) 

xscj) = 7x5, (17) 

e = 7(a;i-fa:4), (18) 

ae = j{x3 + X5), (19) 

1 = (l3 + j{x2+Xi+X5), (20) 
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where x = ks/kg and (p — [S*]/[S]t are the input and output, respectively. 
Combining Eqns. T5]|20 yields a third-degree polynomial equation for tf) 



= 



Xix - o)<f>^ + + X)(x - a)e + x(2x + (^Px ~ a)7 - xix 
+X [a(l + X)e + X(l + a/3)7 - (2% - a)] -fc^ - x't'- 



(21) 



In principle, Eqn. 21 can be solved for the input-output relation 4'{x)- However, the solution to such a cubic 
equation is quite unwieldy, and we therefore focus on limits of Eqn. 21 (or the original Eqns. 15|[20 ) at high 
input (x ^ 1), and in the zero-order ({£,7} <C 1) and linear regimes ({??,i^} ^ 1). Here, for notational 
convenience, we have defined 

7 A -f K 

We find the maximal output value, (f){x ^ 1) 
is obtained when x^^ — exactly; by Eqn. 
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nax, directly from Eqns. TSpO The leading order result 
this leads to 0:4 = [EaS] = 0, which makes sense because 
at infinite catalytic rate the complex EaS has zero lifetime. Combining the five remaining equations 
yields a quadratic equation. 



= 0,„ax + [ae + (1 + a/3)7 - 



■7, 



(23) 



whose solution directly gives ^max- 

In the zero-order regime, to zeroth order in the small parameters (7 = e = 0), Eqn. 23 reads = 
0max(0max — 1), giving the maximal output value (/>niax = 1: it is possible to activate all substrate molecules 
at high input. Further insight is revealed by Eqn. |21[ which for j — e = reduces to 



= (x-a) 



!)• 



Here, when x 7^ a, (/> must be either or 1, implying switch- like behavior around the threshold x* 
switch-like behavior is the hallmark of zero-order sensitivity . 

In the linear regime, we obtain (/)max by rewriting Eqn. [23] in terms of 77 and u: 



= '?'/'max + [ai^ + {1 + af3) - T]](p„ 



1. 



To zeroth order in the small parameters (77 = = 0), Eqn. 25 gives 

1 



(24) 
a. This 

(25) 
(26) 



1 + ap 

Interestingly, for a symmetric network (/? = a = 1), we see that in the linear regime it is only possible to 
activate half the substrate molecules at high input. We also obtain the threshold value from Eqn. 21 which 
in terms of r/ and reads 



= x(x -a)ri (j) +[a{l + x)ix~a)iy + xi^x + aPx -a)- x(x J?*/* 
+X [a(l + x)v + X(l + a/3) - (2x - a)r/] - x'- 

Taking Eqn. [27] to first order in rj and v yields 

= ?7(2x + a(3x - a)4P + [a{l + x)^ + x(l + - (2x " aH <P - X, 
into which we insert (p = 0max/2 = 1/[2(1 + a/3)] and solve for x, yielding the threshold value 



X 



a{l + 2al3)r) + 2a(l + al3)v 
2(1 -I- a/3)2 + (1 3a/3)r/ - 2a(l -f al3)v 



a(l-f2a/3) 



2{l + apf 



(27) 



(28) 



(29) 



We see that while in the zero-order regime the threshold is set by the ratio of activating to deactivating 
enzymes, a, in the linear regime the threshold vanishes in proportion to the small parameters that define 
the regime, 77 and v. We find that Eqn. 29 also serves as a good estimate for the threshold in a double- 
modification network with linear sensitivity, and thus explains why the threshold shifts to small x in Fig- 
[UA. of the main text. 
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B.2 The double-modification network 



We now consider the double-modification network, which prescribes additional reactions identical to Eqns. 
[T][2j except with S and S* replaced by S* and S** , respectively. As in the main text, we restrict our analysis 
to networks whose first and second modification processes are identical (i.e. the rates fci, fc2, ■ ■ ■ ,kQ describing 
the first modification also describe the second). There are now nine species, described by the dimensionless 



variables in Eqn. 13 the new variables 



K 



X7 



_ [ EgS*] 

K 



Xs 



_ [ EdS**] 
K 



(30) 



and the redefined output (j) = \S**\I\S\t- As before the nine rate equations contain three redundancies 
from the zero net fiux of activating enzyme, deactivating enzyme, and substrate; together with the three 
conservation laws we thus have nine independent equations for nine unknowns: 



X\X-2. 


= /3a;5, 


(31) 




= a;4, 


(32) 




= a;5, 


(33) 


X\Xq 




(34) 




= 72^7, 


(35) 




= 72^8, 


(36) 


e 


7(xi + X4 + X7), 


(37) 


ae 


= 7(2:3 + X5 + Xs), 


(38) 


1 


= + 7(2:2+2:4 + 2:5 + 2:6 + 2:7 + 2:8). 


(39) 



Although it is no longer straightforward to combine Eqns. 3T]|39 into a single polynomial in 0, results in 
certain limits can be obtained directly from the equations themselves. For example, we may immediately 
seek the maximal output value 0max by taking the limit x ^ 1 to zeroth order (x~^ = 0). By Eqns. 32 and 
35 this limit leads to 2:4 = ^ \EaS\ = and 2:7 = ^ [£'a'5'*] = 0, respectively, which make sense because 



at infinite catalytic rate ^3 the complexes EaS and EaS* have zero lifetime. Although the remaining seven 
equations still do not lead easily to a single equation for 0max, it is possible to derive an expression for 0niax 
directly in the linear regime. 



The linear regime implies that the parameters 77 and v are small (Eqn. 22 1; in terms of these parameters, 
the remaining seven equations read: 

(40) 
(41) 
(42) 
(43) 
(44) 
(45) 
(46) 

Furthermore, since the linear regime is defined by the fact that both substrate and enzyme concentrations are 



2:12:2 


= ^25, 


2:62:3 


= 2:5, 


212:6 


= Pxs, 


?70max2:3 


= 28, 


V 


= 21, 


av 


= 2:3+^5+2:8, 


V 


= ?70max + 2:2 + X5 +Xe+Xs 



much smaller than K, the dimensionless variables Xi are also small (Eqns. [13 30). We may then recognize 
that Eqns. 40pT and 42p3] imply that 2:5 and xg, respectively, are small to second order. To first order, 
then, X5 — Xs — 0, and 

(47) 
(48) 
(49) 
(50) 
(51) 



XIX2 


= /326X3, 


xiXe 


= ^??0max2:3, 


V 


= Xi, 


av 


= 2:3, 


V 


= ?70max + X2 + X6 
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where Eqns. 47 and 
Eqns. [47][5T] for ^^a: 



come from combining Eqns. 40]|4T and 42]|43 respectively. It is now trivial to solve 
yielding 

0max — Z I TT"; 27)2' ("'^^ 

1 + ap + a'^p'^ 



Upon comparing this expression with that for the single-modification network (Eqn. 26 1, we see that the 
maximal output for the double-modification network is suppressed by an additional term in the de- 

nominator. Indeed, with respect to the deactivating enzymes, if the activating enzymes are fewer (a > 1) or 
associate more weakly to the substrate (/? > 1), suppression of the output is severe in the linear regime. 

Additionally, the result that and xg are small to second order implies that [EdS*] and [EdS**\ are 
much smaller than [Ed\. To leading order, then, [-B^It ~ [Ed]i i-e. in the linear regime the deactivating 
enzymes are approximately all free, even at maximal input. Indeed, this feature is a primary difference 
between the two sensitivity regimes: in the zero-order regime either the activating or deactivating enzymes 
are saturated by the substrate, while in the linear regime both activating and deactivating enzymes are free. 
The implications of this freedom for signaling are discussed at several points in the main text. 



C Rebinding time distribution in one dimension 



In this appendix, we consider the problem of a particle diffusing in a one-dimensional space that is free on 
one side and has a radiation boundary condition on the other. The distribution of first-passage times at 
the boundary is directly obtainable from the Green's function for this problem, which is known. The result 
provides a good approximation for the distribution of rebinding times to a planar membrane populated with 
absorbing constituents, for excursions long enough that the plane appears as a uniform, semi-absorbent sink. 

We consider a particle diffusing along the positive z-axis with a radiation boundary at z = 0. The 
diffusion equation describes the evolution of the probability density p{z\t): 



dp[z\t) 
dt 



= D 



d^p{z\t) 



(53) 



The radiation boundary condition states that the flux leaving the boundary is due to a reaction, which 
requires both that the particle is at the boundary, with probability p{Q\t), and that the reaction fires, with 
intrinsic rate k (dimensions length per time): 



D 



dp{z\t) 
dz 



= fcp(0|<). 



z=0 



The solution given that the particle starts at point zq, i.e. 

p(z|0) = 5(z-zo), 
which is called the Green's function, is known 1331 : 



(54) 



(55) 



p{z\t,ZQ) 



1 



V47rD< 



\.-^(z-zof/(ADt) ^-{z+z„f liADt) 



^^^kH/D^kiz+zo)/D^^^^ ( i±i£ + 

D \ JiDt V D 




(56) 



The first term is the solution for a reflecting boundary, and the second term describes the loss of probability 
incurred by the reaction; erfc denotes the complementary error function. 

The distribution of first-passage times through the boundary P{t\zo) is equal to the flux out of the 
boundary at time t, which by Eqn. [54] is 



P{t\zo)^kp{0\t,zo). 



We obtain p{0\t, zq) directly from Eqn. 56 yielding 



Pit\zo) 



k 



zl/iiDt) _ ^ kH/D^kzo/D^^^^ 

D 




(57) 



(58) 
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We specialize to the distribution of rehinding times r 
boundary, zo — 0. Eqn. 58 then becomes 



t by demanding that the particle starts at the 



P{r) 



d' 



'/^erfc 




e''/''perfc ( \/r/rp] 

r/r„ V ^ / 



(59) 



(60) 



In the second line we have recognized that reaction and diffusion define a characteristic timescale Tp = D/k'^. 
The meaning of this timescale is elucidated by considering times much shorter or longer than Tp, as described 
below. 



At short times (r <C Tp), the second term in Eqn. 60 is unity to leading order, and the first term dominates, 
producing a r~^/^ scaling: 

1 



r <^ Vr, 



(61) 



Such short times correspond to a collision-dominated regime: the particle does not diffuse appreciably far 
from the boundary; instead, it makes quick bounces against the boundary, getting refiected until ultimately 
becoming absorbed. 

The above intuition can be sharpened in two ways. First, we may quantify the notion of "appreciably 
far" by realizing that reaction and diffusion define a characteristic length d = D/k. The speed at which a 
particle travels this length in time Tp is d/rp = k, revealing that the reaction rate k may also be interpreted 
as the mean velocity at which particles are "pulled" into the boundary. Particles diffusing farther than d 
escape this radiative pull, while particles remaining within d stay close to the boundary until absorbed. 

Second, we may appeal to Bayes's rule to understand the scaling in Eqn. |61| Supposing that a short 
excursion is comprised of a number of unsuccessful reflections, ultimate absorption requires the radiation 
reaction to fire given that the particle is at the boundary {z = 0). The probability of this event occurring at 
time r can be written using Bayes's rule as 



p{r\z = 0) = 



p{z = 0\r)p{r) 
P{z = 0) 



oc p{z = 0|r)p(r), 



(62) 



where p{z = 0) = drp{z = 0\r)p{r) normalizes the distribution and is independent of time. The first 
term on the right, for a reflecting particle, is equivalent to the free-particle solution in one dimension: 
p{z = 0\r) — (47rDr)~-'^/^e~'^°^ ^'^-^^ oc r^^^^. The second term is described by an exponential waiting time 



distribution, whose time constant must be given by the only timescale in the problem, r 
For r <^ Tp, p(r) « is constant to leading order, and p{r\z = 0) oc r~^^'^ , as in Eqn. 
At long times (r 3> ^p), the erfc in Eqn. 60 can be approximated by its asymptotic 



,: p{r) 



61 



imit, yielding 



P(r) 




1 •3-5 -...(271-1) 

(2r/rp)" 



(63) 
(64) 



to leading order. Such long times correspond to a search-dominated regime: the time spent far from the 
boundary diffusing is much greater than the time spent close to the boundary making short reflections. 
The process is therefore well approximated by a one-dimensional random walker returning to an absorbing 
origin, which scales as r~^^'^ . Indeed, explicitly imposing an absorbing boundary by taking fc — > oo makes 
the crossover time Vp 0, such that the distribution scales as r^'^/^ for all times. 

In the main text, we rescale r by the characteristic time to diffuse a molecular diameter ^, yielding the 
dimensionless rebinding time p = r/{i'^/D) and the associated crossover time pp = rp/{£'^/D) = (D/ik)'^. 
The and p^'^/^ scalings, as well as the crossover time pp, are observed in Fig. |4]of the main text in the 

planar regime, in which a substrate molecule diffuses far enough from the membrane that the problem can 
be approximated as one-dimensional, but not so far that it encounters the reflective boundary opposite the 
membrane. 
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